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A NOTE ON “REGULARITY LEMMA FOR DISTAL 
STRUCTURES” 
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In their recent paper |CS] , Chernikov and Starchenko prove that graphs defined 
in distal theories have the strong Erdds-Hajnal property: if i? C x M” is a 
definable relation in some distal structure M, there is a > 0 such that if A C M"*, 
B C M” are finite sets, then there are Aq Q A, Bq C B with |Ao| > Q!|A|, 
|i3o| > o:\B\ and R D Aq x Bq is either empty or equal to Aq x Bq. In fact, they 
prove a more general statement for hypergraphs and generically stable measures 
instead of counting measure, see Theorem 12.21 below. This generalizes a theorem of 
Alon et al. [APP~*~0^ who proved it if R is semi-algebraic. 

Chernikov and Starchenko’s proof uses both the theory of Keisler measures in 
NIP and some combinatorial arguments. The purpose of this note is to remove 
the latter and give a purely model-theoretic (and in fact quite short) proof of the 
strong Erdds-Hajnal property in distal theories. We also answer a small question 
raised in [CS] about uniformly cutting finite sets and show that this property is 
equivalent to having no generically stable types. We keep this note short and refer 
to the excellent introduction of |CS) for background. 

Thanks to Sergei Starchenko for comments on a preliminary draft. 

1. Generically stable and smooth measures 

We record here some facts about generically stable and smooth measures in NIP 
theories. Everything appeared in previous works, possibly in different terms. See 
e.g. |Siml41 Chapter 7] for details. 

Throughout this note, we assume that our ambient theory T is NIP. Recall that 
a (Keisler) measure p,{x) over some set A is a finitely additive probability measure 
on the boolean algebra of A-definable sets (in free variable x). Such a measure 
extends uniquely to a regular Borel measure on Sx{A). 

Of special importance are measures /r(x) which can be approximated by average 
measures on finite sets. Such measures are called generically stable. Here is one 
way to define them, where Av(ai,... ,an]X) means ^\{i : Oi S Y}|. 

Definition 1.1. A measure p,{x) over a model M is generically stable if for ev¬ 
ery formula (j){x; y) and e, there are oi,..., a„ G M such that for any b G 
\y{(l){x\ b)) - Av(oi, ...,an; (pix; 6))| < e. 

In particular, the normalized counting measure on a finite set {oi,...,a„} is 
generically stable. The VC-theorem implies that the number n above depends only 
on (j){x; y) and e: 
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Fact 1.2. Let (j){x;y) be a formula and e > 0, then there is an integer n such that 
for any generically stable measure y(x) on a model M, they are ai,...,an G M 
such that for any b G |/i((/)(x; b)) — Av(ai,..., a„; (j){x; &))| < e. 

In fact, the number n depends only on the VC-dimension of (j) (and e). 

Corollary 1.3. Let L be an index set and for i & I, fj,i{x) a generically stable 
measure on a model Mi. For lA an ultrafilter on L, the ultraproduet riw is a 
generically stable measure on 

Proof. Let y = T\u pti and fix a formula 4>{x]y) and e > 0. For each i, take 
ai(*),..., Onii) in Mi given by Fact ll.2l for yi. Define dk ■= [ak{i) : i G /] G Hw 
Then for any parameter b, we have \y{(j){x; b)) — Av(di,..., dk] f’i.x] &))| < e. Hence 
y is generically stable. □ 

A measure y{x) over M is called smooth if it has a unique extension to any 
bigger model N. We have the following equivalent condition (which follows easily 
by compactness, see |Siml41 Lemma 7.8]). 

Fact 1.4. A measure y{x) over M is smooth if and only if for every formula 4>{x] y) 
and e > 0, there are finitely many formulas ifi^y) and dl(x), 0f(x) all over M such 
that: 

(1) the formulas ipiiy) partition y-space; 

(2) for all i and b G if M \= 'ipi{b), then 

M h (l){x]b) e1{x))] 

(3) for all i, y{0‘i{x)) - y{9](,x)) < e. 

Let y{x) and v{y) be two measures over the same base M. We say that a 
measure uj{x, y) over M is a product measure of y(x) and Xijj) if u}{4>{x) A '!/'(?/)) = 
y{(j){x)) ■ X{ip{y)) for any two definable sets (j){x) and 'ip{y). We let {y x X){x,y) 
denote the partial measure defined on the boolean algebra generated by rectangles 
(l){x) A ifiy) and giving such a set the product measure y{4>{x)) • X{'ip(y)) (this 
determines the measure on the boolean algebra since a boolean combination of 
rectangles can be written as a disjoint union of rectangles). Note that if X{y) is a 
type, then any extension of y{x) U X{y) is a product measure. 

We say that two measures y{x) and A(y) over M are weakly orthogonal if there 
is a unique product measure of y{x) and X{y). 

Lemma 1.5. Two measures y{x) and X{y) over M are weakly orthogonal if and 
only if the following property holds: 

For any M-definable set D(x,y) and e > 0, there are two definable sets D~(x,y) 
and Df (x, y) such that: 

(i) Dt and Df are union of rectangles Ai(x) x Bi(y); 

(ii) Dt CD C D+; 

{Hi) {y X X){Df) — (y X X){Dt) < e. 

Proof. Right to left: the conditions imply that any product measure must give D 
measure infe>o(/r x X){Df) = supj<g(^ x X){D~). Hence there is a unique such 
measure. 

For the converse, we use Lemma 7.3 in [Siml4) . It says that given a boolean 
algebra Ll of definable sets and a partial measure yo on H, we can extend yo to a 
Keisler measure y and additionally impose y{D) = r for some definable set D and 
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real r G [0,1] as long as there is no obvious obstruction, namely any B € fl, B C D 
has measure < r and any B € H., D C B has measure > r. We apply this here with 
V, being the algebra generated by rectangles. □ 

Lemma 1.6. Let fj.{x) be a measure over M. Then is smooth if and only if 
it is weakly orthogonal to all measures over M. 

Proof. By definition, a measure is smooth if and only if it is weakly orthogonal to 
all types over M. Hence right to left is fine. 

Left to right is |HPS12[ Corollary 2.5] and follows easily from Fact ll.41 a formula 
4>{x',y) is well approximated from below by the union of rectangles P{x,y) := 
\JiO°{x) A ipiiy) and from above by Q{x,y) := \J^0l{x) A 'ipi{y). If uj(x,y) is a 
product measure of y{x) and some A(y), then uj(Q) — uj{P) cannot be more than 
e. □ 

If ... ,Xn) is a relation and sets Ai C are given, we say that the 

tuple (Hi,..., An) is i?-homogeneous if either for all (aii,..., x„) G Hi x • • • x H„, 
R{xi,... ,Xn) holds or for all (aii,... ,Xn) G Hi x • • • x H„, ^R{xi,... ,x„) holds. 

Corollary 1.7. Let y,{x) be a smooth measure and A(j/) any Keisler measure. Let 
R{x]y) be a formula over M and e > 0. Then there is an M-definable partition 
Pi,... ,Pn of x-space and Qi,..., Qm of y-space such that 

'^y{P,)X{Qj) < e, 

where the sum runs over all pairs {i,j) such that {Pi,Qj) is not R-homogenous. 

Proof. By the previous lemmas, there are two definable sets P,Q C which 

are union of rectangles Ai(x) x Bi{y) such that: 

1. P CRCQ- 

2. uj{Q) — uj{P) < e, for any product measure uj{x,y) of /r(x) and X{y). 

We then obtain the partition {Pi)i by taking all the atoms in the boolean algebra 
generated by the H^’s and same for {Qj)j and the Bfs. □ 

This generalizes immediately to a product of n-many smooth measures (or indeed 
n — 1 smooth measures and an arbitrary one). 

Corollary 1.8. Let /xi(xi),...,/i„(x„) be smooth measures over M. Fix a for¬ 
mula R{xi,... ,Xn) and e > 0. Then for each i, there is an M-definable partition 
P(i, 1),..., P(i, m(i)) of Xi-space such that 

^ ' /^l(L*(l) ii)) ' ■ ' yn{.PijL, in)) < C, 

where the sum runs over all tuples (ii,..., in) for which the family (^*(1, *i), • ■ ■, Pin, t 
is not R-homogeneous. 

2. DISTALITY AND THE MAIN RESULT 

The class of distal theories was introduced in [Siml3) to capture the notion of an 
order-like, or purely unstable, NIP theory. There are various equivalent definitions 
of distality in terms of indiscernible sequences, invariant types, measures, arbitrary 
types... We only need to know here that a theory is distal if and only if all generically 
stable measures are smooth. Thus we have: 


Lemma 2.1. Let T be distal, then an ultraproduct of smooth measures is smooth. 
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Proof. This follows immediately from Corollary 11.31 and the fact that generically 
stable measures and smooth measures coincide. □ 

We now have all we need to prove the main theorem. The following is Corollary 
4.6 in |CS) . which is the final statement in Sections 3 and 4 of that paper. We prove 
the version with parameters directly, although as observed in the proof of Corollary 
4.6, it would follow at once from the simpler parameter-free version. 

Theorem 2.2. Let T he distal, Ry(xi ,..., Xd) = R(xi ,..., Xd] y) a definable rela¬ 
tion. Given a > 0, there is S > 0 such that: for any parameter b, for any generically 
stable (equiv. smooth) product measure lo on x • • • x with uj{Rb) > a, 

there are definable sets Ai C with Lo\xi{Ai) > 5 for all i such that Ai C R^. 

Moreover, each Ai is defined by an instance of a formula that depends only on 
R and a. 

Proof. This is an direct consequence of Corollary 11.71 and Lemma 12.11 We give 
details. First, by Corollarv ll.81 given uj and Rb, we can find sets Ai C such 

that Y\AiC Rb and uj\xi{Ai) > 6 for some <5 > 0 depending on all the data. 

Now assume that the conclusion is false. For simplicity of notations, assume L is 
countable and list all tuples of the form {0i{xi;yi ),..., Od{xd; yd)) as {9j : j < to), 
where 9j = {9l{xi]yij),... ,9)^{xd',yd,j))- Given an integer n, there is a smooth 
measure u = uj(n) over a model M = M{n) and a parameter b — b{n) g M[n) 
such that u}(Rb) > a and for any j < n and any choice of parameters hij from M, 
setting Ai = 9j {M-,bij), either for some i, uilxfiAi) < 1/n, or it is not the case that 
n^A^cRb. 

Take a non-principal ultraproduct of the uj{n) to obtain a measure a) over some 
model M and some b G M such that oj{Ri) > a. By Lemma [2Tl uj is smooth. Hence 
by the first paragraph, we can find H^’s and 5 for Cj and R^ as in the statement. The 
same 5 and formulas defining the H^’s will work for almost all factors, contradicting 
the construction. □ 

We can also prove the asymmetric version which is Theorem 3.6 in [CSj . 

Theorem 2.3. Let T he distal and R{x,y) a definable relation. Let fi G (0, i). 
There are a G (0,1) and formulas (j)i{x; zi), (j) 2 iy] Z 2 ) such that: 

For any Keisler measure X{x) and generically stable measure fa{y) both on M, 
there are parameters Cl G andc 2 G with \{(j)i{x] ci)) > a, /i((/) 2 ( 2 /; C 2 )) > 

/3 and the pair (j)i{M-,ci), 4'2(,M\C2) is R-homogeneous. 

Proof. Choose R and fi G {0,^ — e) and fix a pair of measures X{x), y,{y) as in 
the statement. As fj. is smooth, we can apply Fact 11.41 to it, with parameter e. It 
gives us formulas fiifix) and 9f{y). Take i such that X{'tpi{x)) =: a > 0 and set 
(l>i{x) = tpiix). At least one of 9j{y) or -^9l{y) has measure > j3. Let 4’2{y) be 
equal to it. Then the pair {(j)i{x), 4>2{y)) (with hidden parameters from M) is as 
required for the given pair of measures. We conclude by a compactness argument 
as in Theorem 12.21 □ 


3. Equipartitions 

In Remark 5.II of [CSj . it is asked whether in distal theories, one can cut finite 
sets uniformly. We answer this question positively. 
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Definition 3.1. Let <S be a sort. We say that T uniformly cuts finite sets in S if 
for every e > 0, there is a formula x(t;j/) such that for any sufficiently large finite 
set A in 5, and r G [0,1], there is a parameter b for which 


\x{A; &)| 

l“4| 


< e. 


We will say that T uniformly cuts generically stable measures on S if the con¬ 
clusion of Proposition 5.12 in |CS) holds, namely: For every formula (f){x]y) and 
e > 0, there is some x(a;; z) such that for any generically stable measure /x on M 
with /x({c}) = 0 for any singleton c G if 0 < r < /x(0(a;; a)), then we can find 

b G M with |/x(0(a;; a) fl x(ai; b)) — rl < e. 

In the following proof, we write a b for |a — 6| < e. 


Lemma 3.2. Let fJ-{x) be generically stable over M and p G Sx{M) such that 
m({p}) > 0- Then p is generically stable. 


Proof. This can be seen in various ways. Here is an argument which does not 
use any additional fact about generically stable measures. Fix a formula y) 
and e > 0. Set a = p{{p}). By regularity of p seen as a measure on Sx{M), 
there is a formula 0{x) G L{M) such that p h 9{x) and p{9{x)) < (1 -b e)a. Set 
<p'{x] y) = (p{x; y)/\9(x) and without loss, assume that for some bn, (f'{x\ bg) = 9{x). 
Take ai,...,a„ given by Definition 11.11 for /i, cf' and e' := ea. Reordering, there 
is k < n such that at \= 9{x) if and only A i < k. We have \k/n\ = (1 -b S)a for 
S < 2e. Then for any b, p{(l){x;b)) 6)) ^ Av(ai,..., a„; &)) = 

Av(ai,..., Ofe; fi^x; &))“ Av(ai,..., oa,; fi^x; b)). Hence p is generically stable. 

□ 


Proposition 3.3. Let T be NIP and S a sort. The following are equivalent: 

(1) T uniformly cuts finite sets in S; 

(2) T uniformly cuts generically stable measures on S; 

(3) any generically stable type concentrating on S is realized. 

Proof. (3) =b (2): First, let p{x) be a hxed generically stable measure over M with 
/x({c}) = 0 for all c. Then if p{{p}) > 0 for somep G Sx{M), p is generically stable. 
By (3) and the assumption on /x, this does not happen. But then by regularity of 
pL, for any p in the support of /x, we can find a clopen set Up C Sx{M) containing 
p of measure p{Up) < e. By compactness of the support, we can extract a finite 
cover which we can refine to be composed of disjoint sets. Hence we obtain a finite 
partition of x-space into dehnable sets of measures < e. By Corollary If.31 the size 
and the formulas involved in this partition can in fact be chosen uniformly in /x. 
By standard coding techniques, we construct a formula x{x', z) such that any finite 
union of members of this partition is equal to some x(a:;6). Then x(x; z) has the 
required properties (and does not depend on (p{x;y)). 

(2) =^> (1): This is a simple compactness argument. If (1) does not hold, then for 
a given e, there is a sequence A„ of hnite sets, |A„| > n such that for any formula 
x{x] y) , for n large enough, x(a:; y) cannot be used to cut A„ with precision e as 
in Definition o Let pn be the normalized counting measure on A„ and let ^ be a 
non-principal ultraproduct of the /x„’s. Then /x is generically stable and /x({c}) = 0 
for all c. Assumption (2) applied to fix] a) = T gives us a formula x(a^; v) which 
can be used to partition the space in sets of arbitrary /x-measures up to e. But then 
the same x{^'i v) works for almost all the measures /x„ contradicting the assumption. 
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(1) => (3): Assume that (3) fails. Then there is a non-constant totally indis¬ 
cernible sequence I = {at : i < oj) oi elements of S (take a Morley sequence of a 
generically stable type). Then by NIP, for any formula x(x; y) there is N such that 
for any b, either b) or I \ x(/; b) has size < N. Thus taking large finite sets of 
I, we see that (1) cannot hold. □ 

Since distal theories satisfy (3), this gives a proof of [CSI Proposition 5.12] for 
all distal theories (that proposition asserts (2) above assuming distality and (1)). 

We end with a question. It is proved in |CS) that there is no infinite distal field 
of characteristic p. In general, it would be very interesting to understand what are 
the possible obstruction to an NIP theory having a distal expansion. One can also 
ask about expansions that uniformly cut finite sets. 

Question 3.4. Does ACFp have an NIP expansion which uniformly cuts finite 
sets? 
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